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Abstract: A model of growth of icosahedral viral capsids is proposed. It 
takes into account the diversity of hexamers' compositions, leading to definite 
capsid size. We show that the observed yield of capsid production implies 
a very high level of self-organization of elementary building blocks. The 
exact number of different protein dimers composing hexamers is related to 
the size of a given capsid, labeled by its T-number. Simple rules determining 
these numbers for each value of T are deduced and certain consequences are 
discussed. 
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1 Introduction 

Just like the world of planets and stars, the world of viruses is ruled by 
numbers. This is particularly true in the case of the numerous group of 
spherical viruses, whose protective protein shells called "capsids" display 
perfect icosahedral symmetry pQ . It is amazing that these structures, known 
to mathematicians since Coxeter's classification [2|, are also observed in the 
so-called fullerenes, huge molecules composed exclusively of carbon atoms, 
predicted by Smalley and Kroto, and discovered in the eighties. 
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Since Caspar and Klug j3] introduced simple rules predicting a sequence of 
observed viral capsids, several models of growth dynamics of these structures 
have been proposed, e.g. A. Zlotnick's model [1] published in 1994. 

The common geometrical feature of many viral capsids and fullerenes is 
their icosahedral shape, with twelve pentagons found on the opposite sides 
of six five-fold symmetry axes, and an appropriate number of hexagons in 
between. The number of hexagons is given by the following simple formula: 
N$ = 10 (T — 1), with T = (p 2 + pq + q 2 ), called the triangular number, and 
where p and q are two non-negative integers [2|. 

In capsids, the building blocks made of coat proteins are called monomers, 
dimers, trimers, pentamers and hexamers, according to their shape, the big- 
ger ones usually being assembled from smaller ones prior to further agglom- 
eration into capsid shells |5j . Sometimes pentameric or hexameric symmetry 
is displayed despite the direct construction from 60 or 180 smaller subunits, 
like in the Cowpea mosaic virus and the Cowpea chlorotic mottle virus, re- 
spectively [5] Although certain virus species grow medium-size capsids cor- 
responding to iV 6 = 20 (like in the C G0 fullerene molecule), or N 6 = 30 and 
Nq = 60, some of them form pure dodecahedral capsids (with exclusively 
pentamers as building blocks), like certain Comoviridae or Cowpea virus [7j), 
while some others, like human Adenovirus jHj, form very huge capsids with 
N e = 240, corresponding to p = 5, q = 

In some cases, the similarity with the fullerene structure is striking: for 
example, the TRSV capsid is composed of 60 copies of a single capsid protein 
(56 000 Da, 513 amino acid residues) [TU], which can be put in a one-to-one 
correspondence with 60 carbon atoms forming a fullerene Cqo molecule; the 
aforementioned Cowpea viruses provide another example of the same type. 

The process of building the icosahedral viral capsids differs quite essen- 
tially from the fullerene formation: fullerenes are formed from carbon atoms 
and small carbon molecules like G%,C^, up to Cg or C±o), etc., in a hot plasma 
around electric arc between two graphine electrodes, whereas capsids are built 
progressively in liquid medium, from agglomerates of giant protein molecules 
displaying pentagonal or hexagonal symmetry, or directly from smaller units 
(monomers or dimers). It also seems that there is no such thing as universal 
assembly kinetics: the way the capsids are assembled differs from one virus 
to another. The T = 7 phage HK47 appears to build pentamers and hexam- 
ers first, then assemble these capsomers to form the final capsid structure, 
whereas another T = 7 phage labeled P22 appears to assemble its capsids 
directly from individual coat proteins (see |9j) and the references within). 
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The common point is the presence of pentagons and hexagons in the 
resulting structure, and the strict topological rules that result from Euler's 
theorem on convex polyhedra: V — E + F = 2, with V number of vertices, 
E number of edges, and F number of faces. From this one derives the fact 
that when only pentagonal and hexagonal faces are allowed, the number of 
pentagons is always N 5 = 12, while the hexagon number is iV 6 = 10(T — 1). 

Contrary to the case of fullerene molecules, whose yield from the hot 
plasma is in the best case no higher than 10% of total mass of carbon sooth, 
viruses use almost 100% of pentamers and hexamers at their disposal to form 
perfect icosahedral capsid structures, into which their DNA genetic material 
is densely packed once the capsid is complete. 

This means that the initial nucleation ratio of pentamers versus hexam- 
ers is very close to its final value in capsids in order to minimize the waste. 
Secondly, the final size of the capsid must depend on particular assembly 
rules, which can be fairly well deduced from the statistical weights of various 
agglomeration steps, found by maximizing the final production rate. Let us 
investigate the rules that define the type and the size of capsids, simultane- 
ously optimizing the production rate. 

2 Probabilistic analysis of agglomeration 

The simplest stochastic model of growth successfully applied to fullerene for- 
mation JI] is based on the assumption that the dominant agglomeration 
processes consist of forming new polygons in the cavities between two poly- 
gons on the border of the existing cluster, by adjoining a C2 or a C3 molecule 
to a cavity found in a cluster already formed. One of such processes is shown 
in FigQ below. 

It is clear that the resulting (666) cluster is wasted for further fullerene 
formation, whereas a (656) cluster can be used in the next agglomeration step. 
Because the (666) clusters are also absent in final fullerene cage, it is easy 
to see that at each of consecutive agglomeration steps the yield of "proper" 
clusters, useful for further fullerene construction, is exactly 1/2. After about 
23 to 24 steps leading from the initial three-polygon structure to an almost 
finished fullerene cage with 27 to 28 (out of total of 32) polygons already in 
place, the total yield would approach 2 -24 ~ 10~ 8 instead of observed 10 _1 , 
i.e. 10% ! This means that there is a mechanism that favours the creation 
of "correct" structures versus the "wrong" ones, so that the average yield of 
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Figure 1: Creation of a new polygon in a cavity between two polygons 

clusters proper for further fullerene construction becomes close to q — 0.957 
at each agglomeration step, ensuring q 2A of order of 10 _1 . 

In the case of fullerenes, the correction is due to the Boltzmann-Gibbs 
factors reflecting the energy differences between four basic processes : creat- 
ing a new pentagon in a (6, 6) cavity, or creating a new hexagon in a (5, 6) or 
in a (6, 6) cavity, assuming that the energy barrier against creation of two or 
three pentagons sticking together is so big that the corresponding Boltzmann 
factor is close to 0. These factors could be evaluated by requiring that the 
successive probabilities of finding pentagons among all polygons in clusters 
of given size (after an n-th agglomeration step) and the corresponding yields 
form a geometric progression jTT] 

In the case of the icosahedral capsid formation the building process is not 
random at all. One can be convinced that a high degree of self-organization 
is involved by considering what would happen if even a small amount of 
randomness was present. Let us exclude from our considerations the capsids 
formed exclusively by pentamers; i.e. pure dodecahedral structures, and 
look at the build-up of bigger capsids involving twelve pentamers and the 
necessary number of hexamers. 

Let us denote the concentration (or the nucleation rate) of pentamers by 
x, that of hexamers by (1 — x). Then the probabilities of doublets are readily 
calculated as follows: 



where Wjk, j, k = 5, 6 are the statistical weights depending on the virus type 
and on the chemical barriers between various sides, and 




(1) 



Q = 2-W 56 x(l-x) + W m (l-x) 2 
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Figure 2: Adding a pentamer or a hexamer to an existing doublet 

is the normalizing factor. Note that we exclude two pentamers coming to- 
gether, i.e. W55 = 0. Similarly, the probabilities of three admissible "triplets" 
in the next agglomeration step displayed in FigJ3]are given by: 



566 



P 56 + 2 • P 66 W 66 , 5 x/Q 2 , P 666 = 2 • P 66 W mfi (1 - x)/Q 2 , (2) 



where Wqq^ and 1^66,6 denote the statistical weights of corresponding agglom- 
eration processes, and Q2 = Wqq^x + Wqq^ (1 — x). Now we can evaluate the 
average pentamer rate x^ in clusters of given size, after fc-th agglomeration 
step. The first three values are given by: 

= ^P 56 , X (2) = 7^656, ^ (3) = ^(2P 56 56 + Pi566 + Pjxk), -.etc 

(In the expression for x^ we use the probabilities for three different al- 
lowed clusters, which are not discussed in detail here, but can be quite easily 
obtained using the appropriate statistical weights and Boltzmann-Gibbs fac- 
tors). 

We can use these formulae in two different ways. Either we impose the 
statistical weights Wij and W^k, and determine the consecutive pentamer 
concentrations in growing clusters, starting from a given initial concentration 
x; or treat the statistical weights as unknowns and determine them from self- 
similarity equations for successive pentamer concentrations: 



„(n+2) _ ™(n+l) „(n+l) _ „(n) 

%AJ JU JU \Xj 



n = 2,3,4... (3) 
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Figure 3: Next agglomeration step: from triplets to quadruplets 

The resulting solutions for the limit values of x and for auxiliary variables 
i = (W 5e )/(W m ), n = (W 56 ,6)/(W 66)5 ), C = (WW)/(WW), etc., although 
usually not in the form of simple fractions, give very good hints concerning 
the assembly rules leading to particular capsid structures. 

Now, as the capsid production rate from initial protein material is close 
to 100%, for fullerene-like (T = 3) capsids we should have the initial rate of 
pentamers versus all capsomers as 3 : 8. In order to keep the same ratio in 
the grown-up capsids, simple "sticking rules" will suffice. 

Let us denote pentamers' sides by symbol p, whereas two different kinds 
of sides on hexamers' edges will be called a and b ( Fig. EJ). Suppose that 
a hexamer can stick to a pentamer with only (p + a)-combination; then 
two hexamers must stick to each other only through a (b + b) combination, 
with both (p + b) and (a + b) combinations being forbidden by chemical 
potential barrier. With these assumptions we get the following statistical 
factors: W m = 15, W m = 9 W 5efi = 3, W 66 , 5 = 5 and W 66fi = 0. With 
these rules the statistics in clusters will converge to the final value x = 3/8 
as shown in Fig0] below: 

Similarly, with a more differentiate hexamer scheme, (abcabc), and with 
the assembling rules allowing only associations of p + a and b + c, we get with 
a 100% probability the T = 4 capsid, with x = 2/7, as shown in Fig. 
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Figure 4: The building formula for the T=3 capsid 




Figure 5: The building formula for the T=4 capsid 

Note that in both cases we show only one of the "basic triangles" forming 
the capsid, which is always made with 20 identical triangles sticking together 
to form a perfect icosahedral shape. 

These examples suggest that strict association rules may exist providing 
precise agglomeration pathways for each kind of icosahedral capsid. Let us 
analyze these rules in more detail. 

3 Combinatorics of icosahedral capsids growth 

The virus capsid growth differs essentially from the fullerene agglomeration. 
Taking into account the complexity of interactions between various proteins 
forming hexamers and pentamers, the Boltzmann factors resulting from the 
energy barriers should reduce to simple dichotomy: certain agglomerations 
are allowed, whereas some others (like binding two pentamers together) are 



7 



just forbidden. In contrast, the Boltzmann factors resulting from energy 
barriers between the allowed processes should not be very different at tem- 
peratures between 20° and 37° C. 

What remains then are pure statistical factors, which must play the deci- 
sive role in order to ensure that the "correct" configurations are produced at 
each consecutive step almost without exception, i.e. practically with a 100% 
yield. Let us show now how these statistical factors can be evaluated, and 
what constraints they imply on the capsomer structure. 

From symmetry considerations (and confirmed by chemical analysis) it 
results that the pentamers are composed from five identical dimers, so that 
their five edges are perfectly equivalent, and that they possess a defined 
orientation, i.e. it is known which one of the two faces will be on the outer 
side of the capsid. All the five sides of a pentamer should be equivalent 
(identical), because 5 is a prime number, and any division into parts will 
break the symmetry. Concerning the hexamers, as 6 is divisible by 2 and 3, 
one can have the following four situations: 

- All 6 sides equivalent, (aaaaaa) 

- Two types of sides, disposed as (ababab) 

- Three types of sides, disposed as (abcabc) 

- Six different sides, (abcdef) 

The hexamers are also oriented, with one face becoming external, and 
the other one turned to the interior of the capsid. The three differentiated 
hexamers are represented in Figure El below. 




Figure 6: Three differentiations of hexamers 

If the viruses were using undifferentiated hexamers with all their sides 
equivalent, then there would be no reason for not creating any kind of struc- 
tures as shown in Fig. 01 , and the final yield would be very low (at best 
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like in the fullerenes, less than 10%). But with differentiated hexamers of 
the (ababab) — type simple selection rules excluding the (p — b) and (ab) as- 
sociations while letting the creation of (p — a) and of (b — b) links, we have 
seen that the issue becomes determined with practically 100% certainty, as 
it follows from the Fig. 0] . 

These sticking rules can be summarized up in a table that we shall call 
the "affinity matrix" , displayed in the Table 1 below: 
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p 





1 





a 


1 








b 








1 



Table I : Affinity matrix for the T = 3 capsid construction 

Here a "0" is put at the crossing of two symbols whose agglomeration is 
forbidden, and a "1" when the agglomeration is allowed. By construction, a 
"1" can occur only once any line or in any column. The next case presents 
itself when one uses the next hexamer type, with a two-fold symmetry : 
(abcabc). Again, supposing that only a-sides can stick to pentamers' sides p, 
there is no other choice but the one presented in Fig. 03 The corresponding 
affinity matrix is as follows: 
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Table II: Affinity matrix for the T = 4 capsid construction 

Finally, let us use the most highly differentiated hexamers of the (abcdef)- 
type. Starting with pentamers surrounded by the hexamers sticking via the 
(p — a)-pairing, we discover that now two choices are possible, leading to left- 
and right-hand sided versions, as shown in the following Fig. [7| 

The corresponding affinity matrices are given below: 
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Figure 7: The building formulae for the T=7 capsids; left and right 
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Table III (a): Affinity matrix for the T = 7 (left) capsid construction 
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Table III (b): Affinity matrix for the T = 7 (right) capsid construction 
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Now a natural question can be asked: what comes next ? In order to grow 
capsids with T-numbers greater than 7, one has to introduce new types of 
hexamers that would never stick to pentamers, but being able to associate 
themselves with certain sides of the former maximally differentiated hexam- 
ers. The result is shown in the Fig. |H1 below: 




Figure 8: The building formula for the T=9 capsid 
The corresponding affinity table is given below: 
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Table IV: Affinity matrix for the T = 9 capsid construction 

For bigger capsids, in which the rate of pentamers is lower, one can not 
obtain proper probabilities unless more than one type of hexamers is present, 
out of which only one is allowed to agglomerate with pentamers. In the case 
of two different hexamer types one obtains either the T = 9 capsid, or, with 
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more exclusive sticking rules, the T = 12 capsid. Finally, in order to get the 
T = 25 adenovirus capsid, one must introduce no less than four hexamer 
types, out of which only one type can agglomerate with pentamers. 




The affinity matrix for T = 12 capsid is as follows: 
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Table V: Affinity matrix for the T = 12 capsid construction 
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Now we can organize all these results in a single table that follows. To each 
value of triangular number T corresponds a unique partition into 1 + (T — 1), 
where the "1" represents the unique pentamer type and (T — 1) is partitioned 
into a sum of certain number of different hexamer types, according to the 
formula 

(T — 1) = «6 + /33 + 7 2 
with non-negative integers a,/3 and 7. 



Type (p,q) 


T = p 2 +pq + q 2 


N 6 = 10(T - 1) 


T decomposition 


(1,1) 


3 


20 


1 + 2 


(2,0) 


4 


30 


1 + 3 


(2,1) 
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60 


1 + 6 


(3,0) 


9 


80 


1 + 6 + 2 


(2,2) 


12 


110 


1+6+2+3 


(3,1) 


13 


120 


1 + 6 + 6 


(4,0) 


16 


150 


1+6+6+3 


(3,2) 


19 


180 


1+6+6+6 


(4,1) 


21 


200 


1+6+6+6+2 


(5,0) 


25 


240 


1 + (4 x 6) 


(3,3) 


27 


260 


1 + (4 x 6) + 2 


(4,2) 


28 


270 


1 + (4 x 6) + 3 


(5,1) 


31 


300 


1 + (5 x 6) 


(6,0) 


36 


350 


1 + (5 x 6) + 2 + 3 



Table VI: Classification of icosahedral capsids. The last column gives 
the number and type of hexamers needed for the construction 
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The inspection of Table VI leads to the following simple rules: 

1) For the construction of a capsid with given triangular number T one 
needs exactly T different proteins (or at least, T different types of sticking 
sides) - because the affinity matrix has the dimension T x T, as easily seen 
in the examples. 

2) By definition, the "affinity matrices" have only one non-zero item 
in each row and in each column; moreover, they are symmetric. This means 
that all capsid protein types (or more exactly, all different sticking sides) 
encountered in a complete icosahedron appear with the same frequency: 60 
times each. This can be most easily seen for the p-type forming a pentamer: 
there are 12 of them in each pentamer, and there are 12 pentamers in any 
icosahedral capsid. But then each p sticks to an a, and exclusively to it: 
therefore, there must be also 12 a-type proteins in the complete capsid, 
and so on, for each different protein. This means that all the subunits that 
assemble in pentamers and hexamers later on have to be produced at exactly 
the same rate in order to optimize capsid production. 

3) The capsomers composing a given capsid should be produced at 
different rates, with a very simple rule: for every dozen of pentamers, one 
should have 60 maximally diversified hexamers of the type (abode f), (and of 
each different type, like the (n a nbn c ndn e nf) in the T = 9 capsid); then 30 
hexamers of each (ababab) type; and 20 hexamers of the (abcabc) type. This 
rule can be easily seen upon inspection of Figures 4 — 8. 

4) In order to know how many (and of which kinds) hexamers should be 
used, the T-number should be partitioned into 1+ the rest, the " 1" staying for 
the unique type of pentamers' side, while "the rest" must be decomposed into 
a sum of numbers 2, 3 or 6, according to the simple factors of 6. This is shown 
in the last column of Table I : we see that T = 3 = l + 2; T = 4 = l + 3; 
then the next cases decomposes asT = 7= l + 6; T = 9 = l + 6 + 2 
(by the way, with this scheme in mind there is no point in trying to build a 
capsid with 1 + 6 + 3 = 10 because 10 cannot be found among the triangular 
numbers !). 

5) There is a clear "evolutionary" pattern in the last table - meaning 
that every next (bigger) type of capsid uses the previous construction, just 
adding a minimal amount of novelty: and it is clear most of the time which 
kind of new hexamer one must add, just looking at the differences between 
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the consecutive T-numbers - e.g. if they differ by 2 or by 3, one should add 
one new hexamer type, ababab or abcabc, respectively; but if they differ by 
4 (e.g. from T=21 to T=25) or by 5 (from T=31 to T=36); one must add 
two new types of {ababab), or one {ababab) and one {abcabc) type. 

6) Finally, one should apply these reasonings also to capsids that are 
not built with classical hexamers and pentamers. One may introduce a "dual" 
picture in which not the sides, but the vertices of capsomers correspond to 
real proteins' extremities. The examples of the alternative realisation of 
T = 3 and T = 4 capsids with pentamers and dimers only are shown in the 
Fig. EH below. 




Figure 10: Alternative realizations of T = 3 and T = 4 capsids One can observe 
how the 2- and 3-fold symmetry axis do appear. 

It can be easily deduced from these figures that the (p — a)-dimers com- 
posing pentamers occur 60 times in each capsid, whereas the {b — 6)-dimers 
occur only 30 times in a complete T = 3 capsid, but the {b — c)-dimers occur 
60 times in each T = 4 capsid. In some cases in a T = 4 capsid the triplets of 
{b — c)-dimers are replaced either by hexamers (then we obtain again a T = 3 
capsid), or by star-like trimers which will then occur 20 times. Whatever the 
decoration, each different letter symbol occurs with the same frequency, i.e. 
60 times in each capsid independently of the value of T. This suggests that 
all dimer proteins are produced at the same rate, and the differentiation pro- 
cess that leads to exclusion rules for subsequent agglomeration occurs later 
on. These realizations of capsid structure are akin to the decoration rules for 
curved Penrose-like tilings introduced by R. Twarock ^3] 
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4 Conclusion: how to hinder capsid produc- 
tion ? 



Capsids are a vital part of viral life cycle, protecting its most essential and 
the most vulnerable part, which is their genetic code. We need to consider 
that viruses can have DNA or RNA genomes and the latter are especially 
sensitive to degradation (by nucleases). In other words, without the capsid, 
the genomes exposed to the hostile external world would be destroyed in a 
couple of hours by the ultraviolet radiation, or by chemical attacks of SO2, 
ozone 3 , N0 2 , etc. The efficient protection provided by capsid shell ensures 
longevity and makes possible long travels from one host to another. Should 
we know how to hinder capsid assembly, tis information would have enormous 
potential benefit for the development of antiviral therapies. 

The existence of a well-defined agglomeration scheme during the capsid 
production suggests many possibilities of destructive intervention. As a mat- 
ter of fact, the more complicated the system and the more intricate the laws 
of its functioning, the more there are ways of hindering it. Let us consider a 
few examples of how to decrease the efficiency of capsid building. 

1) Create seven-sided polygons - heptamers - with the same proteins 
that usually form pentamers. Such a unit will be also surrounded by hex- 
amers, creating local negative curvature, thus enhancing the proper capsid 
production. Such an achievement is quite unlikely becaus of the steric hin- 
drance - there is just not enough place in order to pack seven proteins which 
usually go in packs by five. Even if there was such a possibility, the hep- 
tamers should be created inside the infected cells, or delivered there in some 
way, which is extremely difficult. 

2) A better way to hinder capsid production would be to produce hex- 
amers or pentamers with "wrong" proteins inserted, or in a wrong order - 
e.g. (abbaab) or (aaabbb) instead of (ababab) so that they could fit with one 
side, but then present a wrong protein to next capsomers trying to agglom- 
erate, thus destroying the symmetry and order of the construction. Again, 
this supposes the creation of modified RNA chains ordering the production 
of different hexamers, and again, the delivery problem seems very difficult to 
solve. 

3) A more natural way to hinder capsid production may be deduced 
from the probabilistic analysis of various agglomeration pathways. Each 
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capsid must contain exactly 12 pentamers and 10 (T — 1) hexamers of various 
kinds. In order to ensure the full use of all these building blocks, their initial 
ratios should be as close as possible to 12 : (10(T — 1)). If for some reasons 
an excess of hexamers were produced, entire capsids still would be completed 
leaving the extra hexamers unemployed. 

But the situation will radically change if an excess of pentamers could be 
created during capsid assembly phase inside an infected cell. The agglomer- 
ation starts around the pentamers, because the probability of a pentamer- 
hexamer association is much higher than that of a hexamer-hexamer asso- 
ciation PI]). What will happen now can be illustrated on a concrete 
example. Let us imagine a great number of "kits" with 12 pentamers and 
20 hexamers (ababab) each. As we know, with simple matching rules allow- 
ing associations (p + a) and (b + b) and forbidding the associations (p + b) 
and (a + a), a complete capsid can be constructed. Suppose now that many 
such "kits" have been dropped on the ground, and many people are trying 
simultaneously to build T = 3 capsids . The rules of the game being that 
once a person grabs a capsomer and sticks it to the partially built capsid, 
the capsomer can not be removed. After some time everybody will succeed 
in constructing an entire full capsid, with no extra capsomers left. 

Suppose now that someone had thrown in some extra hexamers. After a 
while some of the hexamers will be left out - but if there were 127V" pentamers 
and more than 20iV of hexamers, there will be still N full capsids completed 
at the end of the day. 

The situation would be totally different had someone poured in an excess 
of pentamers. Supposing that people grab capsomers one by one at random, 
there will be hardly one or two full capsids completed. 

This comes out from simple probability calculus. Let us start with the 
simplest example, 20 hexamers {ababab) and 13 pentamers (ppppp). Of course, 
if the things were not happening at random, but in an organized way, one 
can construct a complete T = 3 capsid and leave the extra pentamer alone. 
There are exactly 13 ways of doing it - just taking the decision which one 
among the 13 pentamers should be chosen to be dropped out. But if the 
agglomeration happens at random, there is no reason that one of the pen- 
tamers should wait until the 12 others are incorporated in a capsid. People 
acting randomly would rather pick up as many hexamers as they can; and 
there is an enormous number of ways of doing it (with all possible permu- 
tations taken into account), so the probability of this to happen is orders of 
magnitude higher than the probability of the happy issue described above. 
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For example, there are 13 x 20 different ways of producing two incomplete 
capsids: a pentamer-hexamer doublet and an incomplete T = 3 capsid with 
one hexamer missing; and there are 13 x 20 x 19 different ways of producing 
a 5 — 6 — 6 agglomerate and an incomplete capsid with two hexamers missing, 
and so on. 

This means that as a result, we shall have in the best case two uncomplete 
capsids, and most probably even a higher number of incomplete, unfinished 
structures. 

This is what would happen to people trying to pick up spare parts at a 
car cemetery, full of naked car frames and spare wheels, and to make up a 
complete car. "Complete" meaning four wheels in one car - with three wheels 
only no car will ever roll out. But if the number of wheels is not 4 times N 
(N being the number of car frames with no wheels), but only half of it, say, 
then what would most probably happen is that almost everybody would end 
up with a car with 3 or 2 wheels, and almost nobody with a complete car. 

In other words, if we could incite some cells to produce exclusively pen- 
tamers of the virus by which it is attacked, - this has to happen inside the 
cell, where the capsids are being produced - then there will be an excess of 
pentamers, and almost no complete capsid will be produced. And with no 
complete capsids viruses will be much less obnoxious. 

To implement such a scenario is certainly a challenge for molecular biology 
and nano-technology. 
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